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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Price, Laura A. 

Pausch, Mark H. 

(ii) TITLE OF INVENTION: Potassium Channels, Nucleotide Sequences 

Enc_o_ding_Them,„and^ 



33? r. 



(iii) NUMBER OF SEQUENCES: 56 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: American Home Products Corporation 

(B) STREET: One Campus Drive 

(C) CITY: Parsippany 

(D) STATE : New Jersey 

(E) COUNTRY: USA 

(F) ZIP : 07054 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: ll-MAR-1997 

( C ) C LAS S I F I CAT I ON : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Matthews, Gale F. 

(B) REGISTRATION NUMBER: 32,269 

(C) REFERENCE /DOCKET NUMBER: 32,421-C2 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-683-2134 

(B) TELEFAX: 201-683-4117 



(2) INFORMATION FOR SEQ ID NO : 1 : 

4 

(i) SEQUENCE CHARACTERISTICS : v 
. (A) LENGTH: 2441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 190.. 2043 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

ACGCGATCGC CGCGAGTGTA TATTTTTTTT TTAGCTCAGT CTTCAGTGTT TCGCGATTCT 60 

CTTTAAAAGA AAAAAAAAAT AATAAGTCAA AACTACAAAC CACACAGCGA AAGGCGAAAG 12 0 

CAACGGTTCC TGCGAGTGTT TATTTTTTTT TTCAACAATT TTTGATCGTA GTGCGACAAT 180 

CCGTCGAGC ATG TCG CCG AAT CGA TGG ATC CTG CTG CTC ATC TTC TAC 22 8 

Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr 
1 5 10 



ATA TCC TAC CTG ATG TTC GGG GCG GCA ATC TAT TAC CAT ATT GAG CAC 
lie Ser Tyr Leu Met Phe Gly Ala Ala lie Tyr Tyr His lie Glu His 
15 20 25 

GGC GAG GAG AAG ATA TCG CGC GCC GAA CAG CGC AAG GCG CAA ATT GCA 
Gly Glu Glu Lys lie Ser Arg Ala Glu Gin Arg Lys Ala Gin lie Ala 
30 35 40 45 

ATC AAC GAA TAT CTG CTG GAG GAG CTG GGC GAC AAG AAT ACG ACC ACA 
lie Asn Glu Tyr Leu Leu Glu Glu Leu Gl y Asp L ys Asn Thr T hr Thr_ 



~50 ~ ~~ " 55 60 



CAG GAT GAG ATT CTT CAA CGG ATC TCG GAT TAC TGT GAC AAA CCG GTT 
Gin Asp Glu lie Leu Gin Arg He Ser Asp Tyr Cys Asp Lys Pro Val 
65 70 75 

ACA TTG CCG CCG ACA TAT GAT GAT ACG CCC TAC ACG TGG ACC TTC TAC 
Thr Leu Pro Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr 
80 85 90 

CAT GCC TTC TTC TTC GCC TTC ACC GTT TGC TCC ACG GTG GGA TAT GGG 
His Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly 
95 100 105 

AAT ATA TCG CCA ACC ACC TTC GCC GGA CGG ATG ATC ATG ATC GCG TAT 
Asn He Ser Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr 
110 H5 120 125 

TCG GTG ATT GGC ATC CCC GTC AAT GGT ATC CTC TTT GCC GGC CTC GGC 
Ser Val He Gly He Pro Val Asn Gly He Leu Phe Ala Gly Leu Gly 
130 135 140 

GAA TAC TTT GGA CGT ACG TTT GAA GCG ATC TAC AGA CGC TAC AAA AAG 
Glu Tyr Phe Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys 
145 150 155 

TAC AAG ATG TCC ACG GAT ATG CAC TAT GTC CCG CCG CAG CTG GGA TTG 
Tyr Lys Met Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu Gly Leu 
160 165 170 

ATC ACC ACG GTG GTG ATT GCC CTG ATT CCG GGA ATA GCT CTC TTC CTG 
He Thr Thr Val Val He Ala Leu He Pro Gly He Ala Leu Phe Leu 
175 180 185 

GTG CTG CCC TGC GTG GGT GTT CAC CTA CTT CGA GAA CTG GGC CTA TCT 
Val Leu Pro Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser 
190 195 200 205 

TCC ATC TCG CTG TAC TAC AGC TAT GTG ACC ACC ACA ACA ATT GGA TTC 
Ser He Ser Leu Tyr Tyr Ser Tyr Val Thr Thr Thr Thr He Gly Phe 
210 215 220 

GGT GAC TAT GTG CCC ACA TTT GGA GCC AAC CAG CCC AAG GAG TTC GGC 
Gly Asp Tyr Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly 
225 230 235 

GGC TGG TTC GTG GTC TAT CAG ATC TTT GTG ATC GTG TGG TTC ATC TTC 
Gly Trp Phe Val Val Tyr Gin He Phe Val He Val Trp Phe He Phe 
240 245 250 

TCG CTG GGA TAT CTT GTG ATG ATC ATG ACA TTT ATC ACT CGG GGC CTC 
Ser Leu Gly Tyr Leu Val Met He Met Thr Phe He Thr Arg Gly Leu 
255 260 265 

CAG AGC AAG AAG CTG GCA TAC CTG GAG CAG CAG TTG TCC TCC AAC CTG 
Gin Ser Lys Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu 
270 275 280 285 



AAG GCC ACA CAG AAT CGC ATC TGG TCT GGC GTC ACC AAG GAT GTG GGC 



Lys Ala Thr Gin Asn Arg lie Trp Ser Gly Val Thr Lys Asp Val Gly 
290 295 300 



TAC CTC CGG CGA ATG CTC AAC GAG CTG TAC ATC CTC AAA GTG AAG CCT 
Tyr Leu Arg Arg Met Leu Asn Glu Leu Tyr lie Leu Lys Val Lys Pro 
305 310 315 

GTG TAC ACC GAT GTA GAT ATC GCC TAC ACA CTG CCA CGT TCC AAT TCG 
Val Tyr Thr Asp Val Asp lie Ala Tyr Thr Leu Pro Arg Ser Asn Ser 
320 325 330 



TGT CCG GAT CTG AGC ATG TAC CGC GTG GAG CCG GCT CCC ATT CCC AGC 
Cys Pro Asp Leu Ser Met Tyr Arg Val Glu Pro Ala Pro He Pro Ser 
335 340 345 

CGG AAG AGG GCA TTC TCC GTG TGC GCC GAC ATG GTT GGC GCC CAA AGG 
Arcr Lys Arg Ala Phe Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg 
350 355 360 365 

GAG GCG GGC ATG GTA CAC GCC AAT TCC GAT ACG GAT CTA ACC AAA CTG 
Glu Ala Gly Met Val His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu 
370 375 380 

GAT CGC GAG AAG ACA TTC GAG ACG GCG GAG GCG TAC CAC CAG ACC ACC 
Asp Arg Glu Lys Thr Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr 
385 390 395 

GAT TTG CTG GCC AAG GTG GTC AAC GCA CTG GCC ACG GTG AAG CCA CCG 
Asp Leu Leu Ala Lys Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro 
400 405 410 

CCG GCG GAA CAG GAA GAT GCG GCT CTC TAT GGT GGC TAT CAT GGC TTC 
Pro Ala Glu Gin Glu Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe 
415 420 425 

TCC GAC TCC CAG ATC CTG GCC AGC GAA TGG TCG TTC TCG ACG GTC AAC 
Ser Asp Ser Gin He Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn 
430 435 440 445 

GAG TTC ACA TCA CCG CGA CGT CCA AGA GCA CGT GCC TGC TCC GAT TTC 
Glu Phe Thr Ser Pro Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe 
450 455 460 

AAT CTG GAG GCA CCT CGC TGG CAG AGC GAG AGG CCA CTG CGT TCG AGC 
Asn Leu Glu Ala Pro Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser 
465 470 475 

CAC AAC GAA TGG ACA TGG AGC GGC GAC AAC CAG CAG ATC CAG GAG GCA 
His Asn Glu Trp Thr Trp Ser Gly Asp Asn Gin Gin He Gin Glu Ala 
480 485 490 

TTC AAC CAG CGC TAC AAG GGA CAG CAG CGT GCC AAC GGA GCA GCC AAC 
Phe Asn Gin Arg Tyr Lys Gly Gin Gin Arg Ala Asn Gly Ala Ala Asn 
495 500 505 

TCG ACC ATG GTC CAT CTG GAG CCG GAT GCT TTG GAG GAG CAG CTG AGA 
Ser Thr Met Val His Leu Glu Pro Asp Ala Leu Glu Glu Gin Leu Arg 
510 515 520 525 

AAC AAT CAC CGG GTG CCG GTC GCG TCA AGA AGT TCT CCA TGC CGG ATG 
Asn Asn His Arg Val Pro Val Ala Ser Arg Ser Ser Pro Cys Arg Met 
530 535 540 

GTC TGC GAC GTC TGT TTC CCT TCC AGA AGA AGC ACC CCT CGC AGG ATC 
Val Cys Asp Val Cys Phe Pro Ser Arg Arg Ser Thr Pro Arg Arg He 
545 550 555 



TGG AGC GCA AGT TGT CCG TGG TCT CGG TAC CCG AGG GTG TCA TCT CGC 
Trp Ser Ala Ser Cys Pro Trp Ser Arg Tyr Pro Arg Val Ser Ser Arg 



s - 



560 565 570 

AGG AAG CCA GAT CCC CGC TGG ACT ACT ACA TCA ACA CGG TCA CGG CGG 1956 
Arg Lys Pro Asp Pro Arg Trp Thr Thr Thr Ser Thr Arg Ser Arg Arg 
575 580 585 

CCT CCA GTC AAT CCT ATT TGC GCA ACG GAC GCG GTC CGC CAC CGC CCT 2 004 

Pro Pro Val Asn Pro lie Cys Ala Thr Asp Ala Val Arg His Arg Pro 
590 595 600 605 



;™ ( 2 ) INFORMATION FOR SEQ ID NO : 2 : 

□ (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 618 amino acids 

I t (B) TYPE: amino acid 

^ (D) TOPOLOGY: linear 

O (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr lie Ser Tyr 
15 10 15 

Leu Met Phe Gly Ala Ala lie Tyr Tyr His lie Glu His Gly Glu Glu 
20 25 30 

Lys lie Ser Arg Ala Glu Gin Arg Lys Ala Gin lie Ala lie Asn Glu 
35 40 45 

Tyr Leu Leu Glu Glu Leu Gly Asp Lys Asn Thr Thr Thr Gin Asp Glu 
50 55 60 

lie Leu Gin Arg lie Ser Asp Tyr Cys Asp Lys Pro Val Thr Leu Pro 
65 70 75 80 

Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr His Ala Phe 
85 90 95 

Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn lie Ser 
100 105 110 

Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr Ser .Val He 
115 120 125 

Gly He Pro Val Asn Gly He Leu Phe Ala Gly Leu Gly Glu Tyr Phe 
130 135 140 

Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys Tyr Lys Met 
145 150 155 160 



JTCG AAT_CGA_ATG__GCA_GCT_TGG— CCA— GGG-GCG-GCG GCG^GGC TAACGAAC AT 2 053 

Ser Asn Arg Met Ala Ala Trp Pro Ala Ala Ala Ala Gly 
610 615 



GGGCTTCCAG 


ATGGAGGATG 


GAGCAACCCC 


GCCATCGGCA 


TTGGGCGGTG 


GAGCCTATCA 


2113 


ACGCAAGGCG 


GCTGCTGGCA 


AGCGCCGACG 


CGAGAGCATC 


TACACCCAGA 


ATCAAGCCCC 


2173 


ATCCGCTCGC 


CGGGGCAGCA 


TGTATCCGCC 


GACCGCGCAC 


GCCTTGGCCC 


AGATGCAGAT 


2233 


GCGACGCGGC 


AGCTTGGCAA 


CCAGTGGCTC 


TGGATCGGCG 


GCCATGGCGG 


CAGTGGCCGC 


2293 


GCGTCGTGGC 


AGCCTCTTCC 


CAGCTACAGC 


ATCGGCATCA 


TCGCTGACCT 


CTGCTCCGCG 


2353 


CCGAAGCAGC 


ATATTCTCGG 


TTACCTCCGA 


AAAGGATATG 


AATGTGCTGG 


AGCAGACGAC 


2413 


CATTGCGGAT 


CTGATTCGTG 


CGCTCGAG 








2441 



Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu Gly Leu lie Thr Thr 
165 170 175 

Val Val lie Ala Leu lie Pro Gly lie Ala Leu Phe Leu Val Leu Pro 
180 185 190 

Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser Ser lie Ser 
195 200 205 

Leu Tyr T yr S er_ Tyr Va l Thr Thr Thr Thr lie Gly Phe_ Gl y As p Tyr 
210 215 220 

Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly Gly Trp Phe 
225 230 235 240 

Val Val Tyr Gin lie Phe Val lie Val Trp Phe lie Phe Ser Leu Gly 
245 250 255 

Tyr Leu Val Met lie Met Thr Phe He Thr Arg Gly Leu Gin Ser Lys 
260 265 270 

Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu Lys Ala Thr 
275 280 285 

Gin Asn Arg He Trp Ser Gly Val Thr Lys Asp Val Gly Tyr Leu Arg 
290 295 300 

Arg Met Leu Asn Glu Leu Tyr He Leu Lys Val Lys Pro Val Tyr Thr 
305 310 315 320 

Asp Val Asp He Ala Tyr Thr Leu Pro Arg Ser Asn Ser Cys Pro Asp 
325 330 335 

Leu Ser Met Tyr Arg Val Glu Pro Ala Pro He Pro Ser Arg Lys Arg 
340 345 350 

Ala Phe Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg Glu Ala Gly 
355 360 365 

Met Val His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu Asp Arg Glu 
370 375 380 

Lys Thr Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr Asp Leu Leu 
385 390 395 400 

Ala Lys Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro Pro Ala Glu 
405 410 415 

Gin Glu Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe Ser Asp Ser 
420 425 430 

Gin He Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn Glu Phe Thr 
435 440 445 

Ser Pro Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe Asn Leu Glu 
450 455 460 

Ala Pro Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser His Asn Glu 
465 470 475 480 

Trp Thr Trp Ser Gly Asp Asn Gin Gin He Gin Glu Ala Phe Asn Gin 
485 490 495 

Arg Tyr Lys Gly Gin Gin Arg Ala Asn Gly Ala Ala Asn Ser Thr Met 
500 505 510 

Val His Leu Glu Pro Asp Ala Leu Glu Glu Gin Leu Arg Asn Asn His 
515 520 525 



Arg Val Pro Val Ala Ser Arg Ser Ser Pro Cys Arg Met Val Cys Asp 
530 535 540 

Val Cys Phe Pro Ser Arg Arg Ser Thr Pro Arg Arg lie Trp Ser Ala 
545 550 555 560 

Ser Cys Pro Trp Ser Arg Tyr Pro Arg Val Ser Ser Arg Arg Lys Pro 
565 570 575 

Asp Pro Arg Trp Thr Thr Thr Ser Thr Arg Ser Arg Arg Pro Pr o Val 

— 580 —5'85—~ 590 



Asn Pro lie Cys Ala Thr Asp Ala Val Arg His Arg Pro Ser Asn Arg 
595 600 605 

Met Ala Ala Trp Pro Ala Ala Ala Ala Gly 
610 615 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : S ingle 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME / KEY : CDS 

(B) LOCATION: 1..1008 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

ATG TCC GAT CAG CTG TTT GTC GCA TTT GAG AAG TAT TTC TTG ACG AGT 
Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 

AAC GAG GTC AAG AAG AAT GCA GCA ACG GAG ACA TGG ACA TTT TCA TCG 
Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 

TCC ATT TTC TTT GCC GTA ACC GTC GTC ACT ACC ATC GGA TAC GGT AAT 
Ser lie Phe Phe Ala Val Thr Val Val Thr Thr lie Gly Tyr Gly Asn 
35 40 45 

CCA GTT CCA GTG ACA AAC ATT GGA CGG ATA TGG TGT ATA TTG TTC TCC 
Pro Val Pro Val Thr Asn lie Gly Arg lie Trp Cys lie Leu Phe Ser 
50 55 60 

TTG CTT GGA ATA CCT CTA ACA CTG GTT ACC ATC GCT GAC TTG GCA GGT 
Leu Leu Gly lie Pro Leu Thr Leu Val Thr lie Ala Asp Leu Ala Gly 
65 70 75 80 

AAA TTC CTA TCT GAA CAT CTT GTT TGG TTG TAT GGA AAC TAT TTG AAA 
Lys Phe Leu Ser Glu, His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys 
85 90 95 

TTA AAA TAT CTC ATA TTG TCA CGA CAT CGA AAA GAA CGG AGA GAG CAC 
Leu Lys Tyr Leu lie Leu Ser Arg His Arg Lys Glu Arg Arg Glu His 
100 105 110 

GTT TGT GAG CAC TGT CAC AGT CAT GGA ATG GGG CAT GAT ATG AAT ATC 
Val Cys Glu His Cys His Ser His Gly Met Gly His Asp Met Asn lie 
115 120 125 



GAG GAG AAA AGA ATT CCT GCA TTC CTG GTA TTA GCT ATT CTG ATA GTA 
Glu Glu Lys Arg lie Pro Ala Phe Leu Val Leu Ala lie Leu lie Val 



# 



130 135 140 

TAT ACA GCG TTT GGC GGT GTC CTA ATG TCA AAA TTA GAG CCG TGG TCT 4 80 

Tyr Thr Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser 
145 150 155 160 

TTC TTC ACT TCA TTC TAC TGG TCC TTC ATT ACA ATG ACT ACT GTC GGG 528 
Phe Phe Thr Ser Phe Tyr Trp Ser Phe lie Thr Met Thr Thr Val Gly 
165 170 175 



TTT - GGC - GAC - TTG - ATG CCC~AGA AGG GAC GGA TAC ATG TAT ATC ATA TTG 576 

Phe Gly Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr lie lie Leu 
180 185 190 

CTC TAT ATC ATT TTA GGT AAA TTT TCA ATG AAA AAA AAA CAA AAA TTC 624 
Leu Tyr He lie Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe 
195 200 205 

AAA ATA TTT TTA GGT CTT GCA ATA ACT ACA ATG TGC ATT GAT TTG GTA 672 
Lys He Phe Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val 
210 215 220 

GGA GTA CAG TAT ATT CGA AAG ATT CAT TAT TTC GGA AGA AAA ATT CAA 72 0 

Gly Val Gin Tyr He Arg Lys He His Tyr Phe Gly Arg Lys He Gin 
225 230 235 240 

GAC GCT AGA TCT GCA TTG GCG GTT GTA GGA GGA AAG GTA GTC CTT GTA 768 
Asp Ala Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val 
245 250 255 

ffi TCA GAA CTC TAC GCA AAT TTA ATG CAA AAG CGA GCT CGT AAC ATG TCC 816 

Ser Glu Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser 
260 265 270 

CGA GAA GCT TTT ATA GTG GAG AAT CTC TAT GTT TCC AAA CAC ATC ATA 864 
Arg Glu Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys His He He 
275 280 285 

CCA TTC ATA CCA ACT GAT ATC CGA TGT ATT CGA TAT ATT GAT CAA ACT 912 
Pro Phe He Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr 
290 295 300 

GCC GAT GCT GCT ACC ATT TCC ACG TCA TCG TCT GCA ATT GAT ATG CAA 960 
Ala Asp Ala Ala Thr He Ser Thr Ser Ser Ser Ala He Asp Met Gin 
305 310 315 320 

AGT TGT AGA TTT TGT CAT TCA AGA TAT TCT CTC AAT CGT GCA TTC AAA 1008 
Ser Cys Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys 
325 330 335 



TAG 



1011 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 



Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 




• 



Ser lie Phe Phe Ala Val Thr Val Val Thr Thr lie Gly Tyr Gly Asn 
35 40 45 

Pro Val Pro Val Thr Asn lie Gly Arg lie Trp Cys lie Leu Phe Ser 
50 55 60 

Leu Leu Gly lie Pro Leu Thr Leu Val Thr lie Ala Asp Leu Ala Gly 
65 70 75 80 

Lys Phe Leu Ser Glu His Leu Val T rp L eu Tyr G ly Asn T yr Leu L ys 

"85 "90 95 

Leu Lys Tyr Leu lie Leu Ser Arg His Arg Lys Glu Arg Arg Glu His 
100 105 110 

Val Cys Glu His Cys His Ser His Gly Met Gly His Asp Met Asn lie 
115 120 125 

Glu Glu Lys Arg lie Pro Ala Phe Leu Val Leu Ala lie Leu lie Val 
130 135 140 

Tyr Thr Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser 
145 150 155 160 

Phe Phe Thr Ser Phe Tyr Trp Ser Phe lie Thr Met Thr Thr Val Gly 
165 170 175 

Phe Gly Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr lie He Leu 
180 185 190 

Leu Tyr He He Leu Gly Lys Phe Ser Met Lys Lys Lys Gin Lys Phe 
195 200 205 

Lys He Phe Leu Gly Leu Ala He Thr Thr Met Cys He Asp Leu Val 
210 215 220 

Gly Val Gin Tyr He Arg Lys He His Tyr Phe Gly Arg Lys He Gin 
225 230 235 240 

Asp Ala Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val 
245 250 255 

Ser Glu Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser 
260 265 270 

Arg Glu Ala Phe He Val Glu Asn Leu Tyr Val Ser Lys His He He 
275 280 285 

Pro Phe He Pro Thr Asp He Arg Cys He Arg Tyr He Asp Gin Thr 
290 295 300 

Ala Asp Ala Ala Thr He Ser Thr Ser Ser Ser Ala He Asp Met Gin 
305 310 , 315 320 

Ser Cys Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys 



325 



330 



335 



(2) INFORMATION FOR SEQ ID NO : 5 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TCCATTTTCT TTGCCGTAAC CGTCGTCACT ACCATCGGAT ACGGTAATCC A 
(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES S : sin gle 

(D) TOPOLOGY^ linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TCATTCTACT GGTCCTTCAT TACAATGACT ACTGTCGGGT TTGGCGACTT G 
(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ala Phe Leu Phe Ser He Glu Thr Gin Thr Thr He Gly Tyr Gly Phe 
15 10 15 

Arg Cys Val Thr Asp Glu Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Ala Phe Leu Phe Ser Leu Glu Thr Gin Val Thr He Gly Tyr Gly Phe 
15 10 15 

Arg Cys Val Thr Glu Gin Cys Ala 
20 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 




Ala Phe Leu Phe Phe lie Glu Thr Glu Ala Thr lie Gly Tyr Gly Tyr 
15 10 15 

Arg Tyr lie Thr Asp His Cys Pro 
20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn 
15 10 15 

lie Ser Pro Thr Thr Phe Ala Gly 
20 

O (2) INFORMATION FOR SEQ ID NO: 11: 

m 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 
M= (B) TYPE: amino acid 

yf£ (C) STRANDEDNESS: single 

HI (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Ala Phe Trp Trp Ala Val Val Thr Met Thr Thr Val Gly Tyr Gly Asp 
15 10 15 

Met Thr Pro Val Gly Phe Trp Gly 
20 

(2) INFORMATION FOR SEQ ID NO : 12 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

Ala Phe Trp Tyr Thr lie Val Thr Met Thr Thr Leu Gly Tyr Gly Asp 
15 10 15 

Met Val Pro Glu Thr lie Ala Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 



# 



# 



(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Ala Phe Trp Trp Ala Gly lie Thr Met Thr Thr Val Gly Tyr Gly Asp 
1 5 10 15 



— - — ~i'l~e~Cys Pro Thr Thr Ala Leu Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 

Gly Leu Trp Trp Ala Leu Val Thr Met Thr Thr Val Gly Tyr Gly Asp 
15 10 15 

Met Ala Pro Lys Thr Tyr lie Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Ala Leu Tyr Phe Thr Met Thr Cys Met Thr Ser Val Gly Phe Gly Asn 
15 10 15 

Val Ala Ala Glu Thr Asp Asn Glu 
20 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 16 : 

Cys Val Tyr Phe Leu lie Val Thr Met Ser Thr Val Gly Tyr Gly Asp 
15 10 15 

Val Tyr Cys Glu Thr Val Leu Gly 
20 




(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Ser Leu Tyr Thr Ser Tyr Val Thr Thr Thr Thr lie Gly Phe Gly Asp 
15 10 15 

Tyr Val Pro Thr Phe Gly Ala Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
Li Ala Phe Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn 

L 1 5 10 15 

B He Ser Pro Thr Thr Phe Ala Gly 

U 20 

1 5 

(2) INFORMATION FOR SEQ ID NO: 19: 

^ (i) SEQUENCE CHARACTERISTICS: 

ip (A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ser He Phe Phe Ala Val Thr Val Val Thr Thr He Gly Tyr Gly Asn 
15 10 15 

Pro Val Pro Val Thr Asn Thr Gly 
20 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: * 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



f 




Ser Leu Tyr Thr Ser Tyr Val Thr Thr Thr Thr lie Gly Phe Gly Asp 
15 10 15 

Tyr Val Pro Thr Phe Gly Ala Asn 
20 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 24 __amino_ac ids ^ 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Ser Phe Tyr Trp Ser Phe lie Thr Met Thr Thr Val Gly Phe Gly Asp 
15 10 15 

Leu Met Pro Arg Arg Asp Gly Tyr 
20 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

3 ; 6 

ATAAAGCTTA AAAATGTCGC CGAATCGATG GAT 33 
^ (2) INFORMATION FOR SEQ ID NO: 23: 

'■%A (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AGCTCTAGAC CTCCATCTGG AAGCCCATGT 3 0 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
AAAAAGCTTA AAATGGCACA CATCACG 



27 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
AAACTCGAGT CATACCTGTG GACT 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



CO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26 

f7 AAAAAGCTTA AAATGGTCGG GCAATTG 

feres: 

GH (2) INFORMATION FOR SEQ ID NO: 27: 

%2 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

s (C) STRANDEDNESS: single 

□ (D) TOPOLOGY: linear 

few 

rJQ • (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 

"* AAAAGCATGC TCATCTGGAT GGGCA 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
AAAAAGCTTA AAATGGCCTC GGTCGCC 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
TTTTCTAGAC TACATCGTTG TCTT 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 

AAAAAGCTTA AAATGAATCT GATCAAC 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
^ (B) TYPE: nucleic acid 

H (C) STRANDEDNESS: single 

W (D) TOPOLOGY: linear 



O (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

AAATCTAGAT TAGTCGAAAC TGAA 

* (2) INFORMATION FOR SEQ ID NO: 32: 

Ly (i) SEQUENCE CHARACTERISTICS: 

l! (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
^ (C) STRANDEDNESS: single 

Cl (D) TOPOLOGY: linear 

SI 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32 
AAAAAGCTTA AAATGCCTGG CGGA 
(2) INFORMATION FOR SEQ ID NO: 33: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

AAATCTAGAG GCTACAGGAA GTCC 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 7 base pairs 



(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
GGGGGTACCA AAATGTCGGG GTGTGAT — — — — -2-7 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TTTTTCTAGA TCAAGAGTTA TCATC 2 5 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 

ATGGTAATAA TCAACCGATC GAACAC CTAT GCCGTTGAGC AGGAAGCATT TCCAAGAGAC 6 0 

AAGTACAATA TTGTCTACTG GCTCGTCATT CTTGTTGGAT TCGGAGTTCT TCTGCCATGG 12 0 

AATATGTTCA TTACTATCGC CCCTGAGTAT TATGTGAATT ATTGGTTCAA ACCGGATGGC 18 0 

GTGGAGACAT GGTATTCGAA AGAATTCATG GGATCTTTGA CGATTGGCTC ACAACTTCCA 240 

AACGCAAGCA TTAATGTTTT CAACCTGTTC CTCATTATTG CTGGTCCCCT GATCTACCGC 300 

GTCTTTGCTC CGGTTTGCTT CAACATCGTC AACCTGACAA TCATTCTCAT CCTCGTCATT 36 0 

GTTCTGGAGC CCACTGAAGA TTCCATGTCC TGGTTTTTCT GGGTAACTCT TGGAATGGCG 42 0 

ACTTCAATCA ATTTTAGCAA TGGGCTATAT GAAAACTCGG TTTATGGAGT TGGTGGCGAT 48 0 

TTTCCGCACA CCTACATTGG CGCTCTCTTG ATTGGAAACA ACATTTGCGG ATTGCTGATA 540 

ACGGT TGTG A AAATCGGAGT GACCTATTTT CTGAATGATG AGCCTAAACT TGTTGCAATC 600 

GTCTATTTCG GCATATCGTT GGTGATCCTT CTGGTGTGTG CAATTGCACT TTTCTTTATC 660 

ACAAAGCAAG ATTTCTACCA CTATCACCAT CAAAAAGGAA TGGAAATTCG CGAAAAGGCG 72 0 

GAAACCGACA GACCGTCTCC ATCCATTCTT TGGAC C AC AT TCACAAACTG TTATGGGCAA 780 

CTCTTCAATG TTTGGTTCTG CTTTGCCGTT ACTCTCACAA TCTTCCCTGT TATGATGACC 84 0 

GTTACCACTC GTGGAGATTC CGGCTTCCTA AACAAAATTA TGTCTGAAAA CGATGAAATC 900 



T AC AC TTTGC TCACAAGTTT CCTCGTCTTC AATTTGTTCG CTGCGATTGG ATCCATAGTT 960 

GCTTCCAAGA TTCACTGGCC GACACCCCGT TACCTCAAAT TTGCCATAAT CTTGCGTGCT 1020 

CTTTTCATTC CATTCTTCTT CTTCTGCAAC TATCGTGTCC AGACGCGTGC TTATCCTGTT 108 0 

TTCTTTGAGT CT AC TGACAT TTTTGTGATT GGTGGAATTG CCATGTCTTT TTCACATGGA 114 0 

TACCTCAGCG CTCTGGCAAT GGGATACACT CCAAACGTCG TGCCATCTCA CTACTCAAGA 1200 



TTTGCCGCTC AGCTTTCCGT TTGCACTCTT ATGGTTGGCC TTCTCACCGG TGGCCTGTGG 1260 

CCCGTTGTTA TTGAGCACTT CGTGGACAAG CCAAGTATCT TATAAATATT TATAGCATTA 1320 

GAGTATACTT GTTATATGTT GTTTTTATTA AGCTGTGGAA TAAAATAATT ATTAAAAAAA 138 0 

AAAAAAAAAA AAAA 1394 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 



Met Ser Pro Asn Arg Trp lie Leu Leu Leu lie Phe Tyr lie Ser Tyr 
15 10 15 

Leu Met Phe Gly Ala Ala lie Tyr Tyr His lie Glu His Gly Glu Glu 
20 25 30 

Lys He Ser Arg Ala Glu Gin Arg Lys Ala Gin He Ala He Asn Glu 
35 40 45 

Tyr Leu Leu Glu Glu Leu Gly Asp Lys Asn Thr Thr Thr Gin Asp Glu 
50 55 60 

He Leu Gin Arg He Ser Asp Tyr Cys Asp Lys Pro Val Thr Leu Pro 
65 70 75 80 

Pro Thr Tyr Asp Asp Thr Pro Tyr Thr Trp Thr Phe Tyr His Ala Phe 
85 90 95 

Phe Phe Ala Phe Thr Val Cys Ser Thr Val Gly Tyr Gly Asn He Ser 
100 105 110 

Pro Thr Thr Phe Ala Gly Arg Met He Met He Ala Tyr Ser Val He 
115 120 125 

Gly He Pro Val Asn Gly He Leu Phe Ala Gly Leu Gly Glu Tyr Phe 
130 135 140 

Gly Arg Thr Phe Glu Ala He Tyr Arg Arg Tyr Lys Lys Tyr Lys Met 
145 150 155 160 

Ser Thr Asp Met His Tyr Val Pro Pro Gin Leu Gly Leu He Thr Thr 
165 170 175 

Val Val He Ala Leu He Pro Gly He Ala Leu Phe Leu Val Leu Pro 
180 185 190 

Cys Val Gly Val His Leu Leu Arg Glu Leu Gly Leu Ser Ser He Ser 



195 200 205 

Leu Tyr Tyr Ser Tyr Val Thr lie Thr Thr lie Gly Phe Gly Asp Tyr 
210 215 220 

Val Pro Thr Phe Gly Ala Asn Gin Pro Lys Glu Phe Gly Gly Trp Phe 
225 230 235 240 

Val Val Tyr Gin lie Phe Val lie Val Trp Phe lie Phe Ser Leu Gly 

245 2 50 255 



Tyr Leu Val Met lie Met Thr Phe lie Thr Arg Gly Leu Gin Ser Lys 
260 265 270 

Lys Leu Ala Tyr Leu Glu Gin Gin Leu Ser Ser Asn Leu Lys Ala Thr 
275 280 285 

Gin Asn Arg lie Trp Ser Gly Val Thr Lys Asp Val Gly Tyr Leu Arg 
290 295 300 

Arg Met Leu Asn Glu Leu Tyr lie Leu Lys Val Lys Pro Val Tyr Thr 
305 310 315 320 

Asp Val Asp lie Ala Tyr Thr Leu Pro Arg Ser Asn Ser Pro Leu Ser 
325 330 335 

Met Tyr Arg Val Glu Pro Ala Pro lie Pro Ser Arg Lys Arg Ala Phe 
340 345 350 

Ser Val Cys Ala Asp Met Val Gly Ala Gin Arg Glu Ala Gly Met Val 
355 360 365 

His Ala Asn Ser Asp Thr Asp Leu Thr Lys Leu Asp Arg Glu Lys Thr 
370 375 380 

Phe Glu Thr Ala Glu Ala Tyr His Gin Thr Thr Asp Leu Leu Ala Lys 
385 390 395 400 

Val Val Asn Ala Leu Ala Thr Val Lys Pro Pro Pro Ala Leu Gin Glu 
405 410 415 

Asp Ala Ala Leu Tyr Gly Gly Tyr His Gly Phe Ser Asp Ser Gin lie 
420 425 430 

Leu Ala Ser Glu Trp Ser Phe Ser Thr Val Asn Glu Phe Thr Ser Pro 
435 440 445 

Arg Arg Pro Arg Ala Arg Ala Cys Ser Asp Phe Asn Leu Glu Ala Pro 
450 455 460 

Arg Trp Gin Ser Glu Arg Pro Leu Arg Ser Ser His Asn Glu Trp 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 33 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 

Met Ser Asp Gin Leu Phe Val Ala Phe Glu Lys Tyr Phe Leu Thr Ser 
15 10 15 



Asn Glu Val Lys Lys Asn Ala Ala Thr Glu Thr Trp Thr Phe Ser Ser 
20 25 30 

Ser lie Phe Phe Ala Val Thr Val Val Thr Thr lie Gly Tyr Gly Asn 
35 40 45 

Pro Val Pro Val Thr Asn lie Gly Arg lie Trp lie Leu Phe Ser Leu 
50 55 60 



lie Gly lie Pro Leu Thr Leu Va l Thr_Ile_Ala_Leu_Ala_Gly_Lys-Phe- 
65 70 75 80 

Leu Ser Glu His Leu Val Trp Leu Tyr Gly Asn Tyr Leu Lys Leu Lys 
85 90 95 

Tyr Leu lie Leu Ser Arg His Arg Lys Glu Arg Arg Glu His Val Cys 
100 105 110 

Glu His Cys His Ser His Gly Met Gly His Asp Met Asn lie Glu Glu 
115 120 125 

Lys Arg lie Pro Ala Phe Leu Val Leu Ala lie Leu lie Val Tyr Thr 
130 135 140 

Ala Phe Gly Gly Val Leu Met Ser Lys Leu Glu Pro Trp Ser Phe Phe 
145 150 155 160 

Thr Ser Phe Tyr Trp Ser Phe lie Thr Met Thr Thr Val Gly Phe Gly 
165 170 175 

Asp Leu Met Pro Arg Arg Asp Gly Tyr Met Tyr lie lie Leu Leu Tyr 
180 185 190 

lie lie Leu Gly Lys Phe Ser Met Lys Lys Lys ' Gin Lys Phe Lys lie 
195 200 205 

Phe Leu Gly Leu Ala lie Thr Thr Met Cys lie Asp Leu Val Gly Val 
210 215 220 

Gin Tyr lie Arg Lys lie His Tyr Phe Gly Arg Lys lie Gin Asp Ala 
225 230 235 240 

Arg Ser Ala Leu Ala Val Val Gly Gly Lys Val Val Leu Val Ser Glu 
245 250 255 

Leu Tyr Ala Asn Leu Met Gin Lys Arg Ala Arg Asn Met Ser Arg Glu 
260 265 270 

Ala Phe lie Val Glu Asn Leu Tyr Val Ser Lys His lie lie Pro Phe 
275 280 285 

lie Pro Thr Asp lie Arg Cys lie Arg Tyr lie Asp Gin Thr Ala Asp 
290 295 300 

Ala Ala Thr lie Ser Thr Ser Ser Ser Ala lie Asp Met Gin Ser Cys 
305 310 315 320 

Arg Phe Cys His Ser Arg Tyr Ser Leu Asn Arg Ala Phe Lys Xaa 
325 330 335 

INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



• 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39 

TNGGATATCT GGATGACTAT T 

(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 9 base_pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



o 

m 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40 
AGTCATCCAG ATAACTC C AG TACTAGTGT 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41 
CGCAGGCAGA GCCACAAAGA GTACACAG 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 
GGAGATCAGC TAGGCACCAT ATTTGG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43 
ATGCTGCATG CCTCATGCTT CCCAGC 



• 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
GGTTATTTAA AGAGAGGGCT 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Leu Pro Ser Ala Ser Arg Glu Arg Pro Gly Tyr Arg Ala Gly Val 
15 10 15 

Ala Ala Pro Asp Leu Leu Asp Pro Lys Ser Ala Ala Gin Asn Ser Lys 
20 25 30 

Pro Arg Leu Ser Phe Ser Thr Lys Pro Thr Val Leu Ala Ser Arg Val 
35 40 45 

Glu Ser Asp Thr Thr lie Asn Val Met Lys Trp Lys Thr Val Ser Thr 
50 55 60 

lie Phe Leu Val Val Val Leu Tyr Leu lie lie Gly Ala Thr Val Phe 
65 70 75 80 

Lys Ala Leu Glu Gin Pro His Glu lie Ser Gin Arg Thr Thr lie Val 
85 90 95 

lie Gin Lys Gin Thr Phe lie Ser Gin His Ser Cys Val Asn Ser Thr 
100 105 110 

Glu Leu Asp Glu Leu lie Gin Gin lie Val Ala Ala lie Asn Ala Gly 
115 120 125 

lie lie Pro Leu Gly Asn Thr Ser Asn Gin lie Ser His Trp Asp Leu 
130 135 140 

Gly Ser Ser Phe Phe Phe Ala Gly Thr Val lie Thr Thr lie Gly Phe 
145 150 155 160 

Gly Asn lie Ser Pro Arg Thr Glu Gly Gly Lys lie Phe Cys lie lie 
165 170 175 

Tyr Ala Leu Leu Gly lie Pro Leu Phe Gly Phe Leu Leu Ala Gly Val 
180 185 190 

Gly Asp Gin Leu Gly Thr lie Phe Gly Lys Gly lie Ala Lys Val Glu 
195 200 205 



Asp Thr Phe lie Lys Trp Asn Val Ser Gin Thr Lys lie Arg lie lie 
210 215 220 

Ser Thr lie lie Phe lie Leu Phe Gly Cys Val Leu Phe Val Ala Leu 
225 230 235 240 

Pro Ala lie lie Phe Lys His lie Glu Gly Trp Ser Ala Leu Asp Ala 
245 250 255 



lle_T_yr._Phe_ Val_Val— Ile_ Thr— Leu- Thr— ^ Thr— I-l-e-G-l-y— Phe-Gly— Asp— Tyr- 

260 265 270 

Val Ala Gly Gly Ser Asp lie Glu Tyr Leu Asp Phe Tyr Lys Pro Val 
275 280 285 

Val Trp Phe Trp lie Leu Val Gly Leu Ala Tyr Phe Ala Ala Val Leu 
290 295 300 

Ser Met lie Gly Arg Leu Val Arg Val lie Ser Lys Lys Thr Lys Glu 
305 310 315 320 

Glu Val Gly Glu Phe Arg Ala His Ala Ala Glu Trp Thr Ala Asn Val 
325 330 335 

_ Thr Ala Glu Phe Lys Glu Thr Arg Arg Arg Leu Ser Val Glu lie Tyr 

U 340 345 350 

SJUi 

gO Asp Lys Phe Gin Arg Ala Thr Ser lie Lys Arg Lys Leu Ser Ala Glu 

H 355 360 365 

Leu Ala Gly Asn His Asn Gin Glu Leu Thr Pro Cys Arg Arg Thr Leu 
Q 370 375 380 



Ser Val Asn His Leu Thr Ser Glu Arg Asp Val Leu Pro Pro Leu Leu 
385 390 395 400 

Lys Thr Glu Ser lie Tyr Leu Asn Gly Leu Ala Pro His Cys Ala Gly 
405 410 415 

Glu Glu lie Ala Val lie Glu Asn lie Lys 
420 425 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 0 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: 

CCATCCTAAT ACGACTCACT ATAGGGCTCG AGCGNCCGCC CGGGCAGTAA AATGCCTGCC 6 0 

CGTGCAGCTC GGAGCGCGCA GCCCGTCTCT GAATAAGAAG TGAGTACAAT GGCGTGTTTG 12 0 

TAAAAAAAAG CTTCAAGTCC GTCTTTTTCA AAAAACATTT TGAATGCTGC ATGCCTCATG 180 

CTTCCCAGCG CCTCGCGGGA GAGACCCGGC TATAGAGCAG GAGTGGCGGC ACCTGACTTG 24 0 

CTGGATCCTA AATCTGCCGC TCAGAACTCC AAACCGAGGC TCTCATTTTC CACGAAACCC 3 00 

ACAGTGCTTG CTTCCCGGGT GGAGAGTGAC ACGACCATTA ATGTTATGAA ATGGAAGACG 360 

GTCTCCACGA TATTCCTGGT GGTTGTCCTC TATCTGATCA TCGGAGCCAC CGTGTTCAAA 420 



GCATTGGAGC AGCCTCATGA GATTTCACAG AGGACCACCA TTGTGATCCA GAAGCAAACA 48 0 

TTCATATCCC AACATTCCTG TGTCAATTCG ACGGAGCTGG ATGAACTCAT TCAGCAAATA 54 0 

GTGGCAGCAA TAAATGCAGG GATTAT AC CG TTAGGAAACA CCTCCAATCA AATCAGTCAC 600 

TGGGATTTGG GAAGTTCCTT CTTCTTTGCT GGCACTGTTA TTACAACCAT AGGATTTGGA 66 0 

AACATCTCAC CACGCACAGA AGGCGGCAAA ATATTCTGTA TCATCTATGC CTTACTGGGA 72 0 



ATTCCCCTCT TTGGTTTTCT CTTGGCTGGA GTTGGAGATC AGCTAGGCAC CATATTTGGA 78 0 

AAAGGAATTG C C AAAGTGG A AGATACGTTT ATTAAGTGGA ATGTTAGTCA GACCAAGATT 84 0 

CGCATCATCT CAACAATCAT ATTTATACTA TTTGGCTGTG TACTCTTTGT GGCTCTGCCT 90 0 

GCGATCATAT TCAAACACAT AGAAGGC TGG AGTGCCCTGG ACGCCATTTA TTTTGTGGTT 960 

ATCACTCTAA CAACTATTGG ATTTGGTGAC TACGTTGCAG GTGGATCCGA TATTGAATAT 102 0 

CTGGACTTCT ATAAGCCTGT CGTGTGGTTC TGGATCCTTG TAGGGCTTGC TTACTTTGCT 1080 

GCTGTCCTGA GCATGATTGG GAGATTGGTC CGAGTGATAT CTAAAAAGAC AAAAGAAGAG 1140 

GTGGGAGAGT TCAGAGCACA CGCTGCTGAG TGGACAGCCA ACGTCACAGC CGAATTCAAA 12 00 

CO GAAACCAGGA GGCGACTGAG TGTGGAGATT TATGACAAGT TCCAGCGGGC CACCTCCATC 1260 

rT AAGCGGAAGC TCTCGGCAGA ACTGGCTGGA AACCACAATC AGGAGCTGAC TCCTTGTAGG 132 0 

2 AGGACCCTGT CAGTGAACCA CCTGACCAGC GAGAGGGATG TCTTGCCTCC CTTACTGAAG 1380 

M= ACTGAGAGTA TCTATCTGAA TGGTTTGGCG CCACACTGTG CTGGTGAAGA GATTGCTGTG 144 0 

^~ ATTGAGAACA TCAAATAGCC CTCTCTTTAA ATAACCTTAG GCATAGCCAT AGGTGAGGAC 1500 

O TTCTCTATGC TCTTTATGAC TGTTGCTGGT, AGCATTTTTT AAATTGTGCA TGAGCTCAAA 1560 

Lai 

La GGGGGAACAA AATAGATACA CCCATCATGG TCATCTATCA TCAAGAGAAT TTGGAATTCT 162 0 

H GAGCCAGCAC TTTCTTTCTG ATGATGCTTG TTGAACGGCC CACTTTCTTT GATGAGTGGA 1680 

SI ATGACAAGCA ATGTCTGATG CCTTTGTGTG CCCAGACTGT TTTCCTCTCT CTTTCCCTAA 174 0 

TGTGCCATAA GGCCTCAGAA TGAATTGAGA ATTGTTTCTG GTAACAATGT AGCTTTGAGG 1800 

GATCAGTTCT TAACTTTTCA GGGTCTACCT AACTGAGCCT AGATATGGAC CATTTATGGA 1860 

TGACAACAAT TTTTTTTTTG TAAATGACAA GAAATTCTTA TGCAGCCTTT TACCTAAGAA 192 0 

ATTTCTGTCA GTGCCTTATC TTATGAAGAA ACAGAACCTC TCTAGCTAAT GTGTGGTTTC 198 0 

TCCTTCCCTG CCCCCACCCC TAGGCTCACC TCTGCAGTCT TTTACCCCAG TTCTCCCATT 2 04 0 

TGAATACCAT ACCTTGNTGG AAACAGNGTG TAAAATG AC T GAAGTGATGA TGCCGAAGAT 210 0 

GAAATAGATG NCAAATTAGN TGGACATTGA 213 0 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 




(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 
AAAAGATCTA AAATGCTTCC CAGCGCC 2 7 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single , — 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
AAAGTCGACC TATTTGATGT TCTCAAT 2 7 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



|lI (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

^ AAAAAGCTTA AAATGCTTCC CAGCGCC 2 7 

Q (2) INFORMATION FOR SEQ ID NO: 50: 

2 (i) SEQUENCE CHARACTERISTICS: 

f7 ( A ) LENGTH: 2 7 base pairs 

H (B) TYPE: nucleic acid 

yQ (C) STRANDEDNESS: single 

Vj (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
AAATCTAGAC TATTTGATGT TCTCAAT 27 
(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
AACAAAAACC TTTTTTGTTT TGAATGGCCT AGAGAGGGTA AGGGATCCCC TGACGAACAG 
GAGCAGAGCC AGCTAGAACC TGGGCCTGGC CAGTTCAAGG CCAC C AGAGG GCAGCCTTCT 



60 
120 



GCGGAAGGCA GTATTGGGGT AGGCAGGGAC CCCAGCAGAC ATGGCACTCA GAGCTCTCAC 180 

TGTCCACTGA CTCTCTCTTC TCCAGGTTAT GGCCACATGG CCCCACTATC GCCAGGCGGA 240 

AAGGCCTTCT GCATGGTCTT ATAGCCCTTG GGCTGCCAGC CTCCTTAGCT CTCGTGGCCA 3 00 

CCCTGCGCCA TTGCCTGCTG CCTGTGCTCA GCCGCCCACG TGCCTGGGTA GCGGTCCACT 360 

GGCAGCTGTC ACCGGCCAGG GCTGCGCTGC TGCAGGCAGT TGCACTGGGA CTGCTGGTGG 42 0 



CCAGCAGCTT TGTGCTGCTG CCAGCGCTGG TGCTGTGGGG CCTTCAGGGC GACTGCAGCC 480 
TGCTGGGGGC CGTCTACTTC TGCTTCAGCT CGCTCAGCAC CATTGGCCTG GGG 533 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

ATGATACGAT TTAATACGAC TCACTATAGG GAATTTGGCC CTCGAGGCCA AGAATTCGGC 60 

ACGAGGAGAA TGTGCGCACG TTGGCTCTCA TCGTGTGCAC CTTCACCTAC CTGCTGGTGG 12 0 

GCGCCGCGGT GTTCGACGCA CTGGAGTCGG AGC CGGAGAT GATCGAGCGG CAGCGGCTGG 180 

AGCTGCGGCA GCTGGAGCTG CGGGCGCGCT ACAACCTCAG CGAGGGCGGC TACGAGGAGC 24 0 

TGGAGCGCGT CGTGCTGCGC CTCAAGCCGC ACAAGGCCGG CGTGCAGTGG CGCTTCGCCG 3 00 

GCTCCTTCTA CTTCGCCATC ACCGTCATCA CCACCATCGG CTATGGTCAT GCGGCGCCCA 360 

GCACGGACGG AGGCAAGGTG TTCTGCATGT TCTACGCGCT GCTGGGCATC CCGCTCACAC 420 

TAGTCATGTT CCAGAGCCTG GGTGAACGCA TCAACACCTC CGTGAGGTAC CTGCTGCACC 480 

GTGCCAAGAG GGGGCTGGGC ATGCGGCACG CCGAAGTGTC CATGGCCAAC ATGGTGCTCA 54 0 

TCGGTTTCGT GTCGTGCATC AGCACGCTGT GCATCGGCGC AGCTGCCTTC TCCTACTACG 600 

AGCGCTGGAC TTTCTTCCAG GCCTATTACT ACTGCTTCAT CACCCTCACC ACCATCGGCT 66 0 

TCGGCGACTA TGTGGCGCTG C AG AAGG AC C AGGCGCTGCA GACGCAGCCG CAGTATGTGG 720 

CTTCAGCTTC GTGTACATCC TCACGGGCTC ACGGTCATCG GCGCTTCCTC AACCTCGTGG 780 

TGCTGCGATT CATGACCATG AACGCCGAGG ACGAGAAGCG TGATGCGGAG CACCGCGCCC 84 0 

TGCTCACGCA CAACGGCCAG GCTGTCGGCC TGGGTGGCCT GAGCTGCCTG AGCGGTAGCC 900 

TGGGCGACGG CGTGCGTCCC CGCGACCCAG TCACATGCGC TGCGGCCGCA AGCTTA 956 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

CTGAAACCAT GGGCCCGATA CCTGCTCCTG CTTATGGCCC ACCTGCTGGC CATGGGCCTT 6 0 

GGGGCTGTGG TGCTTCAGGC CCTGGAGGGC CCTCCAGCTC GCCACCTCCA GGCCCAGGTC 12 0 

CAGGCTGAAC TGGCTAGCTT CCAGGCAGAG CACAGGGCCT GCTTGCCACC TGAGGCCCTG 18 0 



GAGGAGCTGC TAGGTGCGGT CCTGAGAGCA CAGGCC CATG GAGTTTCCAG CCTGGGCAAC 24 0 

AGCTCAAGAC AAGCAACTGG GATCTGCCCT CAGCTCTGCT GTTCACTGCC AGCATCCTCA 3 00 

CCACCACCGG TTATGGCCAC ATGGCCCCAC TCTCCTCAGG TGGAAAGGCC TTCTGTGTGG 36 0 

TCTATGCAGC CCTTGGGCTG CCAGCCTCTC TAGCACTTGT GGCTGCCCTG CGCCACTGCT 42 0 

TGCTGCCTGT GTTCAGTCGC CCAGGTGACT GGGTAGC CAT TCGCTGGCAG CTGGCAC C AG 48 0 

CTCAGGCTGC TCTGCTACAG GCAGCAGGAC TGGGCCTCCT GGTGGCCTGT GTCTTCATGC 54 0 

TGCTGCCAGC ACTGGTGCTG TGGGGTGTAC AGGGTGACTG GCAGCCTGCT AAACCATCTA 600 

CTTCTGTTTC GGCTCACTCA GCACGATCGG CCTAGGAGAC TTGCTGCCTG CCCATGGACG 66 0 

TGGCCTGCAC CCAGCCATTT ACCACCTTGG GCAGTTTGCA CTTCTTGGTT ACTTGCTCCT 72 0 

GGGGCTCCTG GCCATGTTGT TAGCAGTAGA GACCTTCTCA GAGCTGCCTC AGGTCCGTGC 78 0 

CATGGTGAAA TTCTTTGGGC CCAGTGGCTC TAGAACCGAT GAAGATCAAG ATGGCATCCT 84 0 

AGGC CAAGAT GAGCTGGCTC TGAGCACTGT GCTGCCTGAC GCCCCAGTCT TGGGACCAAC 900 

CACCCCAGCC TGAGCGGGAG GCACCAAGGA GTGCTTGAAG AACATAGCAG AAGGGTTATG 960 

GGAATGAATA TGTCATGGGA TAATGTTAAT TTTAAAAATT AAATGGGCTG CTTAGCATGC 102 0 

AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA 1052 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Asn Lys Asn Leu Phe Cys Phe Glu Trp Pro Arg Glu Gly Lys Gly Ser 
15 10 15 

Pro Asp Gin Glu Glu Gin Ser Gin Leu Glu Pro Gly Pro Gly Gin Phe 
20 25 30 

Lys Ala Thr Arg Gly Gin Pro Ser Ala Glu Gly Ser lie Gly Val Gly 
35 40 45 

Arg Asp Pro Ser Arg His Gly Thr Gin Ser Ser His Cys Pro Leu Thr 
50 55 60 

Leu Ser Ser Pro Gly Tyr Gly His Met Ala Pro Leu Ser Pro Gly Gly 
65 70 75 80 




Lys Ala Phe Cys Met Val Leu Xaa Ala Leu Gly Leu Pro Ala Ser Leu 
85 90 95 

Ala Leu Val Ala Thr Leu Arg His Cys Leu Leu Pro Val Leu Ser Arg 
100 105 110 

Pro Arg Ala Trp Val Ala Val His Trp Gin Leu Ser Pro Ala Arg Ala 
115 120 125 

Ala Leu Leu Gin Ala Val A la Leu _Gly^Leu_Leu— Val— Ala— Ser— Ser— Phe 
130 ~ ~~ 135 140 



Val Leu Leu Pro Ala Leu Val Leu Trp Gly Leu Gin Gly Asp Cys Ser 
145 150 155 160 

Leu Leu Gly Ala Val Tyr Phe Cys Phe Ser Ser Leu Ser Thr lie Gly 
165 170 175 

Leu Gly 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
• (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Gly lie Trp Pro Ser Arg Pro Arg lie Arg His Glu Glu Asn Val Arg 
15 10 15 

Thr Leu Ala Leu lie Val Cys Thr Phe Thr Tyr Leu Leu Val Gly Ala 
20 25 30 

Ala Val Phe Asp Ala Leu Glu Ser Glu Pro Glu Met lie Glu Arg Gin 
35 40 45 

Arg Leu Glu Leu Arg Gin Leu Glu Leu Arg Ala Arg Tyr Asn Leu Ser 
50 55 60 

Glu Gly Gly Tyr Glu Glu Leu Glu Arg Val Val Leu Arg Leu Lys Pro 
65 70 75 80 

His Lys Ala Gly Val Gin Trp Arg Phe Ala Gly Ser Phe Tyr Phe Ala 
85 90 95 

lie Thr Val lie Thr Thr lie Gly Tyr Gly His Ala Ala Pro Ser Thr 
100 105 110 

Asp Gly Gly Lys Val Phe Cys Met Phe Cys Met Phe Tyr Ala Leu Leu 
115 120 125 

Gly lie Pro Leu Thr Leu Val Met Phe Gin Ser Leu Gly Glu Arg lie 
130 135 140 

Asn Thr Ser Val Arg Tyr Leu Leu His Arg Ala Lys Arg Gly Leu Gly 
145 150 155 160 

Met Arg His Ala Glu Val Ser Met Ala Asn Met Val Leu lie Gly Phe 
165 170 175 

Val Ser Cys lie Ser Thr Leu Cys lie Gly Ala Ala Ala Phe Ser Tyr 




180 185 t 190 

Tyr Glu Arg Trp Thr Phe Phe Gin Ala Tyr Tyr Tyr Cys Phe lie Thr 
195 200 205 

Leu Thr Thr lie Gly Phe Gly Asp Tyr Val Ala Leu Gin Lys Asp Gin 
210 215 220 

Ala Leu Gin Thr Gin Pro Gin Tyr Val Ala Ser Ala Ser Cys Thr Ser 

_ 225 _ 23 0 . 235 240 

Ser Arg Ala His Gly His Arg Arg Phe Leu Asn Leu Val Val Leu Arg 
245 250 255 

Phe Met Thr Met Asn Ala Glu Asp Glu Lys Arg Asp Ala Glu His Arg 
260 265 270 

Ala Leu Leu Thr His Asn Gly Gin Ala Val Gly Leu Gly Gly Leu Ser 
275 280 285 

Cys Leu Ser Gly Ser Leu Gly Asp Gly Val Arg Pro Arg Asp Pro Val 
290 295 300 

Thr Cys Ala Ala Ala Ala Ser Leu 
305 310 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 04 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Leu Lys Pro Trp Ala Arg Tyr Leu Leu Leu Leu Met Ala His Leu Leu 
15 10 15 

Ala Met Gly Leu Gly Ala Val Val Leu Gin Ala Leu Glu Gly Pro Pro 
20 25 30 

Ala Arg His Leu Gin Ala Gin Val Gin Ala Glu Leu Ala Ser Phe Gin 
35 40 45 

Ala Glu His Arg Ala Cys Leu Pro Pro Glu Ala Leu Glu Glu Leu Leu 
50 55 60 

Gly Ala Val Leu Arg Ala Gin Ala His Gly Val Ser Ser Leu Gly Asn 
65 70 75 80 

Ser Ser Xaa Thr Ser Asn Trp Asp Leu Pro Ser Ala Leu Leu Phe Thr 
85 90 95 

Ala Ser lie Leu Thr Thr Thr Gly Tyr Gly His Met Ala Pro Leu Ser 
100 105 110 

Ser Gly Gly Lys Ala Phe Cys Val Val Tyr Ala Ala Leu Gly Leu Pro 
115 120 125 

Ala Ser Leu Ala Leu Val Ala Ala Leu Arg His Cys Leu Leu Pro Val 
130 135 140 

Phe Ser Arg Pro Gly Asp Trp Val Ala lie Arg Trp Gin Leu Ala Pro 
145 150 155 160 



Ala Gin Ala Ala 



Cys Val Phe Met 
180 

Asp Trp Gin Pro 
195 

Thr_Ile_ Glyi_Leu 
210 

Pro Ala lie Tyr 
225 

Leu Gly Leu Leu 



Pro Gin Val Arg 
260 

Thr Asp Glu Asp 
275 

Ser Thr Val Leu 
290 



Leu Leu Gin Ala 
165 

Leu Leu Pro Ala 



Ala Xaa Thr lie 
200 

J31y_Asp_Leu_Le_u 
215 

His Leu Gly Gin 
230 

Ala Met Leu Leu 
245 

Ala Met Val Lys 



Gin Asp Gly lie 
280 

Pro Asp Ala Pro 
295 



Ala Gly Leu Gly 
170 ^ 

Leu Val Leu Trp 
185 

Tyr Phe Cys Phe 



Pro Ala Hi s Gl y 
220 

Phe Ala Leu Leu 
235 

Ala Val Glu Thr 
250 

Phe Phe Gly Pro 
265 

Leu Gly Gin Asp 



Val Leu Gly Pro 
300 



Leu Leu Val Ala 
175 

Gly Val Gin Gly 
190 

Gly Ser Leu Ser 
205 

Arg Gly Le u His 



Gly Tyr Leu Leu 
240 

Phe Ser Glu Leu 
255 

Ser Gly Ser Arg 
270 

Glu Leu Ala Leu 
285 

Thr Thr Pro Ala 



